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CLAIMS: 



1 . A method of generating a language model (7) for a speech recognition system 

(1), characterized \ 

in that a first text corpus (10) is gradually reduced by one or various text 
corpus parts in dependence on text data of an application-specific second text corpus (11) and 

in that the values of the language model (7) are generated on the basis of the 
reduced first text corpus (12) is used. 



2. A methc 

corpus parts by which 
text corpus (10), in the 
evaluated. 



d as claimed in claim 1, characterized in that for determining the text 
ihe first text corpus (10) is reduced, unigram frequencies in the first 
educed first text corpus (12) and in the second text corpus (1 1) are 



3. A method Ws claimed in claim 2, characterized in that for determining the text 
corpus parts, by which thh first text corpus (10) in a first iteration step and accordingly in 
further iteration steps-is reduced, the following selection criterion is used: 

PAi \ X M) 

with N spez (xM) as the frequency of the M-gram x M in the second text corpus, p(x M ) as the 
M-gram probability derived from the frequency of the M-gram x M in the first training corpus 
and p A (x M ) as the M-graiA probability derived from the frequency of the M-gram x M in the 

first training corpus reducedlby the text corpus part A\. 

4. A method as ckimed in claim 3, characterized in that tri grams are used as a 
basis with M = 3 or bigrams with M = 2 or unigrams with M = 1 . 



5. A method as claimed in one of the claims 1 to 4, characterized in that a test 

text (15) is evaluated to determinathe end of the reduction of the first training corpus (10). 
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6. A method as claimed in claim 5 5 characterized in that the reduction of the first 

training corpus (10) is terminated when a certain perplexity value is reached or a certain 
OOV rate of the test text! especially when a minimum is reached. 



7. A method 

(1), characterized in that 
by one or various other 
an application-specific 
language model (7) are 



of generating a language model (7) for a speech recognition system 
text corpus part of a given first text corpus is gradually extended 
corpus parts of the first text corpus in dependence on text data of 
corpus to form a second text corpus and in that the values of the 
generated while the second text corpus is used. 



text ■ 



text 



8. A method of generating an acoustic model (6) for a speech recognition system 

(1), characterized 

in that acoustic training material representing a first number of speech 
utterances is gradually rec uced by training material parts representing individual speech 
utterances in dependence ■ m a second number of application-specific speech utterances and 

in that the '< eoustic references (8) of the acoustic model (6) are formed by 
means of the reduced acoi istic training material. 



9. A method c f generating an acoustic model (6) for a speech recognition system 

(1), characterized in that apart of given acoustic training material, which material represents 
a multitude of speech utter uices, is gradually extended by one or more other parts of the 
given acoustic training material and in that the acoustic references (8) of the acoustic model 
(6) are formed by means off the accumulated parts of the given acoustic training material. 



25 10. A speech recognition system comprising a language model generated in 

accordance with one of the qlaims 1 to 7 and/or an acoustic model generated in accordance 
with claim 8 or 9. 




